EAST Search History 



Ref 
# 


Hits 


Search Query 


DBs 


Default 
Operator 


Plurals 


Time Stamp 


LI 


18250 


(530/350).CCLS. 


US-PGPUB; 

USPAT; 

DERWENT 


OR 


OFF 


2006/03/02 15:44 


L2 


0 


LI and "c-Cbl associated protein" 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT 


OR 


ON 


2006/03/02 15:44 


L3 


0 


LI and "Cbl associated protein" 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT 


OR 


ON 


2006/03/02 15:45 


L4 


2227 


LI and "CAP" 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT 


OR 


ON 


2006/03/02 15:45 


L6 


1098 


L4 and insulin 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT 


OR 


ON 


2006/03/02 15:46 


L7 


27197 


Oda.in. 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT 


OR 


ON 


2006/03/02 15:46 


L8 


395 


L7 and CAP 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT 


OR 


ON 


2006/03/02 15:47 


L9 


4 


L8 and insulin 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT 


OR 


ON 


2006/03/02 15:46 


L10 


5466 


CAP with (binding protein) 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT 


OR 


ON 


2006/03/02 15:47 


Lll 


264 


"CAP binding protein" 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT 


OR 


ON 


2006/03/02 15:47 


L12 


78 


Lll and insulin 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT 


OR 


ON 


2006/03/02 15:47 



3/2/06 3:48:21 PM 

C:\Documents and Settings\SMayer\My Documents\EAST\Workspaces\CAP binding protein. wsp 
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untitled 

242 GAGCTGGCGTTGCTCCCGCCACCGCCGCCGCCGCCGCCGACTCCCGCGACCCCGACGTCC 301 

Db 302 tcggcgtccmcctggacctgggcgagcagcgggagcgctgUagacgttccagaa^ 361 

Qy 122 GlriLysLeuThrSerGluGlyAlaAlaLysLeuLeuLeuAspThrPheGluTyrGlnGly 141 

Db 362 CAGAAGCTTACCTCCGAGGGTGCCGCCAAGCTCCTGCTAGACACCTTTGAATACCAGGGC 421 

Db 422 CTGGTGAAGCACACAGGAGGCTGCCACTGTGGAGCAGTTC^ 481 

Qy 162 AlaAspLeuHi'sllePheAspCys 169 

AO II I II I II I MM II 1 1 II III II 

Db 482 GCAGACTTGC ATATATTTGACTGC 505 
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SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



CDS 



Homo sapiens (human) titled 
Homo sapiens 

ESfflI? ; E!thS?I: Eu h ar r ch^n^ mata; Yertebrata; Euteleostomi • 
Hominidae; Homo ' Euarchon t°9li res; Primates; Catarrhini; ' 

1 (bases 1 to 1097) 

SXfe"-' K ™**""W.A.N.. Zhang,*., chait.B.T. and 

2 (bases 1 to 1097) 

SS£Wij& Maturris '«->- 

Baltimore, MD 21205, USA ruDMC Health, 615 N. Wolfe St., 

Location/Qualifiers 
1. .1097 

/orgarn'sm="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon:9606" 

/clone="iMAGE: 3936128" 
<1. .817 

'/n&SSt* " the " Uclear P^ery" 

/E™2 U - t= "[J u S lear Protein p30" 
/protein_id="AAM76703.1" 
/db_xref ="gi : 21717632 " 



ORIGIN 

Alignment Scores: 

Pred. No.: 2.72e-142 

Score: . ., . 168.00 

Percent similarity: 100.00% 
Best Local Similarity: 100.00% 

Query Match: 96.00% 

DB: 8 



Length: 

Matches: 

Conservative: 

Mismatches: 

Indels: 

Gaps: 



US-10-530-886-2 (1-175) x AF514992 (1-1097) 



1097 

168 

0 

0 

0 

0 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



2 CGGCGATC<WGGAGCTCTGCGGCCGCCAAGCT^GCGGGCAGAA „ 



er 21 



62 



122 TCCGCTAGCCAGGCCGGGAGCAAGAGCWGGCGOTGGAG^ m 

* 2 1 1 n? vtr?t r??rn i r rr?t r ?t7 r? J Tf i v?7 v?i rr r?? ?y?i r?? r?"™— « 

182 CGGCTGAGGCGCTCGTCGCCGCGGGCCCAGGAGGAGGGCCCGGGGGAGCCGCCGCCGCCT 241 

82 < ; ""-^Al«-euL e uProProProProProProProProThrProAUThrProThr S er 101 

Page 8 



JOURNAL 

FEATURES 

source 



ORIGIN 



.. untitled 
diseases 

Patent: WO 2005019258-A 4392 03-MAR-2005; 
Genentech, inc. (us) 

Locati on/Qual i f i ers 

1. .1097 

/organism="Homo sapiens" 
/mol_type="unassigned dna" 
/db_xref="taxon:9606" 



Alignment Scores: 

Pred. No.: 2.72e-142 

Score: 168.00 

Percent Similarity: 100.00% 
Best Local Similarity: 100.00% 

Query Match: ' 96.00% 

DB: 6 



Length: 
Matches : 
Conservative: 
Mismatches: 
Indels: 
Gaps : 



1097 

168 

0 

0 

0 

0 



US-10-530-886-2 (1-175) x CS043838 (1-1097) 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

D 



CGGCGATCGAGGAGCTCTGCGGCCGCCAAGCTGCGCGGGCAGAAGCGGTCCGGGGCCT 



CC 61 



22 Al aAl aProAl aAl aSerAl aAl aAl aAl aLeuAl aProserAl aThrAraThrArnarn 



41 




62 oc^i^i^^^i^^^^^^^^^^^^i^^^Mm in 

122 TCCGCTAGCCAGGCCGGGAGCAAGAGCCAGGCGGTGGAGAAGCCGCCGTCGGAGAAGCCG 181 

62 ArgLeuArgArgSerserProArgAlaGlnGluGluGlyproGlyGluProProProPro 81 
, B I III II I III 111 I MINIM 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 if 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

182 CGGCTGAGGCGCTCGTCGCCGCGGGCCG^GGAGGAGGGCCCGGGGGAGCCGCCGCCG^ 241 
82 9lV L ?" A l aL ? uLeuProPr °P' , oProProProProProThrProAlaThrProThrSer mi 

302 TCGGCGTCCAACCTGGACCTGGGCGAGCAGCGGGAGCGCTGGGAGACGTTCCAGAAGCGG 361 
362 CAGMGCTTACCTCCGAGGGTGCCGCCAAGCTCCTGCTAGACACC^ 421 

422 CTGGTGMGCACACAGGAGGCTGCCACT^ 481 

162 AlaAspLeuHisllePheAspcys 169 
A „ I 1 I 1 1 I I I I I I I I I I | | | I I T I T I 
482 GCAGACTTGCATATATTTGACTGC 505 



AF514992. 1097 bp mRNA linear 

A2c?/iQo? iens nuclear Protein p30 mRNA, partial cds. 

GI: 21717631 
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PRI 16-SEP-2002 



AF514992 
AF514992.1 



ORIGIN 

Alignment Scores: 

Pred. No.: 2.72e-142 

Score: 168.00 

Percent similarity: 100.00% 
Best Local similarity: 100.00% 

Query Match: 96.00% 

DB: 6 



Untitled 



Length: 

Matches: 

conservative: 

Mismatches: 

indels: 

Gaps: 



1097 

168 

0 

0 

0 

0 



US-10-530-886-2 (1-175) x CS043836 (1-1097) 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



2 ArgArgSerArgSerSerAlaAlaAlaLysLeuArgGlyGlnLysArgSerGlyAlaSer 21 
IIIIITIIIIITIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIITIMIIIIIIIII 

2 CGGCGATCGAGGAGCTCTGCGGCCGCCAAGCTGCGCGGGCAGAAGCGGTCCGGGGCCTCC 61 
22 AlaAlaProAlaAlaSerAlaAlaAlaAlaLeuAlaProSerAlaThrArgThrArgArg 41 

I MM MMIMMMMMI MM MM MM II MIMI MM MM 111 I Mil Ml 

6 2 GCGGCCCCCGCGGCCTCCGCGGCCGCTGCCTTGGCACCCAGCGCCACCCGCACACGGCGC 12 1 



42 
122 
62 



SerAlaSerGlnAlaGlySerLysSerGlnAlaValGluLysProProSerGluLysPro 61 
I I 1 1 I I 1 1 1 I I 1 1 1 1 1 1 1 1 I I I 1 1 I 1 1 I I 1 1 1 1 i I 1 1 1 I 1 1 1 1 1 I 1 1 I I I 1 1 I 1 1 I I I I I 

TCCGCTAGCCAGGCCGGGAGCAAGAGCCAGGCGGTGGAGAAGCCGCCGTCGGAGAAGCCG 18 1 



ArgLeuArgArgSerSerProArgAl aGl nGl uGl uGl yProGlyGl uProProProPro 81 
Mill Mil IMIIIIIIIMIMI Illlllll MM Illlllll MM I III MM I II 

182 CGGCTGAGGCGCTCGTCGCCGCGGGCCCAGGAGGAGGGCCCGGGGGAGCCGCCGCCGCCT 241 



82 GluLeuAlaLeuLeuProProProProProProProProThrProAlaThrProThrSer 
I I M I I ! I 1 I II I I 1 I I I M I I I I 1 1 I I 1 1 I 1 1 1 M 1 1 M I I 1 I I I i I 1 1 II I I i I M I I 

242 GAGCTGGCGTTGCTCCCGCCACCGCCGCCGCCGCCGCCGACTCCCGCGACCCCGACGTCC 



102 
302 
122 
362 
142 
422 
162 
482 



SerAl aSerAsnLeuAspLeuGlyGl uGl nArgGl uArgTrpGl uThrPheGl nLysArg 
I f I I I ! 1 1 I I I E I I I I I ! I 1 E I I I i I T I I I I I T I I I I I I I I j I I ! I I I T 

TCGGCGTCCAACCTGGACCTGGGCGAGCAGCGGGAGCGCTGGGAGACGTTCCAGAAGCGG 



101 
301 
121 
361 



Gl nLysLeuThrSerGl uGlyAl aAl aLysLeuLeuLeuAspThrPheGl uTyrGl nGly 141 
MIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

CAGAAGCTTACCTCCGAGGGTGCCGCCAAGCTCCTGCTAGACACCTTTGAATACCAGGGC 42 1 

Leuval LysHi sThrGlyGlyCysHi sCysGlyAl aval ArgPheGl uValTrpAl aser 161 
IMIIMII llllll MM Mil IIIIIIMIIII MM Mil III I llllllll Mill 

CTGGTGAAGCACACAGGAGGCTGCCACTGTGGAGCAGTTCGTTTTGAAGTTTGGGCCTCA 481 

AlaAspLeuHisllePheAspCys 169 

1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GCAGACTTGCATATATTTGACTGC 505 



RESULT 4 
CS043838 
LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 
TITLE 



CS043838 1097 bp DNA 

Sequence 4392 from Patent WO2005019258. 
CS043838 

CS043838.1 Gl: 61850673 



linear PAT 22-MAR-2005 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae; Homo. 
1 

Abbas, A., Clark, H., Ouyang.W., Williams, P.M. , Wood, W.I. and Wu.T.D. 
Compositions and methods for the treatment of immune related 
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OB: 



untitled 
Gaps: 



US-10-530-886-2 (1-175) x CS034886 (1-1097) 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

oy 

Db 



2 ArgArgserArgserserAlaAlaAlaLysLeuArgGlyGlnLysArgSerGlyAlaSer 21 
llTllTlllllTllllll llllllllTlllllllllllTllllllllllll ci 

2 CGGCGATCGAGGAGCTCTGCGGCCGCCAAGCTGCGCGGGCAGAAGCGGTCCGGGGCCTCC 61 

22 AlaAlaProAlaAlaSerAlaAlaAlaAlaLeuAlaProSerAlaThrArgThrArgArg 41 
M 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 II 1 1 1 1 1 1 M 1 1 II 1 1 Mil I 1 1 IIIlT 

62 GCGGCCCCCGCGGCCTCCGCGGCCGCTGCCTTGGCACCCAGCGCCACCCGCACACGGCGC 12 1 

42 Se rAl ase rGl nAl aGl yse rLysSe rGl nAl aval Gl uLys P ro ProSe rGl uLys P ro 61 
lllllllllllllllimillllllllll Illllllllllllllllllllll 

122 TCCGCTAGCCAGGCCGGGAGCAAGAGCCAGGCGGTGGAGAAGCCGCCGTCGGAGAAGCCG 181 

62 ArqLeuArgArgSerSerProArgAlaGlnGluGluGlyProGlyGluProProProPro 81 
IITIIIIITIITIIIIIIIIIIITIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIII 

182 CGGCTGAGGCGCTCGTCGCCGCGGGCCCAGGAGGAGGGCCCGGGGGAGCCGCCGCCGCCT 241 

82 GluLeuAlaLeuLeuProProProProProProProProThrProAlaThrProThrSer 101 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II I M 

242 GAGCTGGCGTTGCTCCCGCCACCGCCGCCGCCGCCGCCGACTCCCGCGACCCCGACGTCC 301 

102 SerAl aSerAsnLeuAspLeuGlyGl uGl nArgGl uArgTrpGl uThrPheGl nLysArq 121 

lllllllllllllllllllllllllllM 
302 TCGGCGTCCAACCTGGACCTGGGCGAGCAGCGGGAGCGCTGGGAGACGTTCCAGAAGCGG 361 

122 Gl nLysLeuThrSerGl uGlyAl aAl aLysLeuLeuLeuAspThrPheGl uTy rGl nGly 141 

I i I 1 1 I I I I I I I 1 I 1 I I I I 1 1 I I 1 1 1 I I 1 I I 1 I I 1 1 1 I I I I 1 I 1 I I I I 1 I 1 I I I I I 1 t I I 
362 CAGAAGCTTACCTCCGAGGGTGCCGCCAAGCTCCTGCTAGACACCTTTGAATACCAGGGC 421 

142 Leuval LysHi sThrGl yGlyCysHi sCysGl yAl aval ArgPheGl uValTrpAl aSer 161 
IIMIIIllMIIIIMlimlllMm /C1 

422 CTGGTGAAGCACACAGGAGGCTGCCACTGTGGAGCAGTTCGTTTTGAAGTTTGGGCCTCA 481 

162 AlaAspLeuHisIlePheAspCys 169 
1 1 1 1 II M 1 1 1 1 M M M M 1 1 1 1 pnp 

482 GCAGACTTGCATATATTTGACTGC 505 



RESULT 3 
CS043836 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



CS043836 1097 bp DNA 

Sequence 4390 from Patent WO2005019258. 
CS043836 

CS043836.1 Gl:61850672 



linear PAT 22-MAR-2005 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Cramata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontogli res ; Primates; Catarrhim; 
Hominidae; Homo. 

Abbas, A., Clark, H., Ouyang,W. , Williams, P.M. , Wood, W.I. and Wu.T.D. 
Compositions and methods tor the treatment of immune related 
diseases 

Patent: WO 2005019258-A 4390 03-MAR-2005; 
Genentech, Inc. (us) 

Locati on/Qual i f i e rs 

1. .1097 

/organism="Homo sapiens" 
/mol_type= M unassigned DNA" 
/db_xref="taxon:9606" 

Page 5 



Qy 22 
Db 

Qy 

Db 

Qy 62 
Db 182 
Qy 82 
Db 242 
Qy 102 
Db 
Qy 
Db 

Qy 

Db 

Qy 162 
Db 482 



Untitled 

Al aAl aProAl aAl aSerAl aAl aAl aAl aLeuAl aProSerAl aThrArgThrArgArg 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II III I Ml 1 1 II II I III 1 1 1 II II IIIIIITIIIMIIII 



41 



62 GCGGCCCCCGCGGCCTCCGCGGCCGCTGCCTTGGCACCCAGCGCCACCCGCACACGGCGC 12 1 

42 SerAl aSerGl nAl aGlyserLysSerGl nAl aval Gl uLysProProSerGl uLysPro 61 
Ml MINIM II III MM III III I MINIMI MM 1 1 III II III MM MM II , M 

12 2 TCCGCTAGCCAGGCCGGGAGCAAGAGCCAGGCGGTGGAGAAGCCGCCGTCGGAGAAGCCG 181 

ArgLeuArgArgSerSer ProArgAl aGl nGl uGl uGl yProGl yGl uProProProPro 81 
NT II I NT N IN 1 1 1 1 1 1 1 1 III N 1 1 1 II 1 1 1 N N I N 1 1 1 II 1 1 II II II 1 1 1 N _ 

CGGCTGAGGCGCTCGTCGCCGCGGGCCCAGGAGGAGGGCCCGGGGGAGCCGCCGCCGCCT 241 

Gl uLeuAl aLeuLeuProProProProProProProProThrProAl aThrProThrSer 101 

N I I I II II I I I II I II I I II I II I I I II II I I I II I II II II I II I II I I I II I II I I I _ 
GAGCTGGCGTTGCTCCCGCCACCGCCGCCGCCGCCGCCGACTCCCGCGACCCCGACGTCC 301 

SerAl aSerAsnLeuAspLeuGl yGl UGl nArgGl uArgTrpGl uThrPheGl nLysArq 121 
M M 1 1 M 1 1 M M M MM Ml II 1 1 M INT IMINNI II II 1 1 M 1 1 1 1 1 1 1 1 1 1 

302 TCGGCGTCCAACCTGGACCTGGGCGAGCAGCGGGAGCGCTGGGAGACGTTCCAGAAGCGG 361 

122 Gl nLysLeuThrSerGl uGlyAl aAl aLysLeuLeuLeuAspThrPheGl uTyrGl nGly 141 

I I I 1 1 I I I I 1 I I I I I I 1 1 I 1 1 1 1 1 I I I I 1 I I I I I I I 1 I I I I I I 1 I I I I I I I I I 1 1 I 1 1 1 I m 
362 CAGAAGCTTACCTCCGAGGGTGCCGCCAAGCTCCTGCTAGACACCTTTGAATACCAGGGC 421 

142 LeuVal LysHi sThrGlyGl ycysHi sCysGlyAl aval ArgPheGl uValTrpAl aSer 161 
M 1 1 1 Ml I M 1 1 1 1 1 II M I II 1 1 M 1 1 M I M M M 1 1 IT 1 1 M M M I M M M I M ^ 

422 CTGGTGAAGCACACAGGAGGCTGCCACTGTGGAGCAGTTCGTTTTGAAGTTTGGGCCTCA 481 

AlaAspLeuHisllePheAspCys 169 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 i 1 1 1 r r 

GCAGACTTGCATATATTTGACTGC 505 



RESULT 2 
CS034886 
LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



CS034886 1097 bp DNA 

Sequence 4392 from Patent WO2005016962 . 
CS034886 

CS034886.1 Gl:60733238 



linear PAT 10-MAR-2005 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; craniata; Vertebrata; Euteleostorm ; 
Mammalia; Eutheria; Euarchontogli res; Primates; Catarrhini; 
Hominidae; Homo. 

Abbas, A., Clark, H., Ouyanq,w. , Williams, M. P. , Wood,W.I. and Wu,T.D. 
compositions and methods for the treatment of immune related 
diseases 

Patent: WO 2005016962-A 4392 24-FEB-2005; 
Genentech, Inc. (US) 

Locati on/Qual i f i e rs 

1. .1097 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref="taxon:9606" 



ORIGIN 



Alignment scores: 

Pred. No.: 2.72e-142 

Score: 168.00 

Percent Similarity: 100.00% 

Best Local similarity: 100.00% 

Query Match: 96.00% 



Length: 1097 

Matches: 168 

Conservative: 0 

Mismatches: 0 

Indels: 0 
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c 
c 
c 



c 
c 
c 



c 
c 
c 
c 



56 


12 


6.9 


188633 


57 


12 


6.9 


193411 


58 


12 


6.9 


200568 


59 


12 


6.9 


201389 


60 


12 


6.9 


202932 


61 


12 


6.9 


204134 


62 


12 


6.9 


207105 


63 


12 


6.9 


210145 


64 


12 


6.9 


210969 


65 


12 


6.9 


216290 


66 


12 


6.9 


224054 


67 


12 


6.9 


230871 


68 


12 


6.9 


230951 


69 


12 


6.9 


235703 


70 


12 


6.9 


244853 


71 


12 


6.9 


253961 


72 


12 


6.9 


254581 


73 


12 


6.9 


256496 


74 


12 


6.9 


265797 


75 


12 


6.9 


271069 



Untitled 
9 AL606932 
14 BX813304 
14 AC093318 
14 AC093358 
9 AC153890 
9 AC145116 
14 AC123332 
5 CR387996 
9 AC139758 
14 AC151275 
14 AC097089 
14 AC162361 
14 AC120683 
9 AC113970 
14 AC103346 
14 AC128235 
14 AC111674 
9 AC158239 
9 AC099884 
14 AC127915 



AL606932 Mouse DNA 
BX813304 Danio rer 
AC093318 Mus muscu 
AC093358 Mus muscu 
AC153890 Mus muscu 
AC145116 Mus muscu 

AC123332 Rattus no 
CR387996 Zebrafish 
AC139758 Mus muscu 
AC151275 Mus muscu 
AC097089 Rattus no 
AC162361 Mus muscu 
AC120683 Rattus no 
AC113970 Mus muscu 
AC103346 Rattus no 
AC128235 Rattus no 
AC111674 Rattus no 
AC158239 Mus muscu 
AC099884 Mus muscu 
AC127915 Rattus no 



ALIGNMENTS 



RESULT 1 
CS034884 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



ORIGIN 



CS034884 1097 bp DNA linear PAT 10-MAR-2005 

Sequence 4390 from Patent WO2005016962 . 

CS034884 

CS034884.1 GH60733237 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; Euteleostomi : 
Mammalia; Euthena; Euarchontogli res; Primates; Catarrhini: 
Homimdae; Homo. 
1 

Abbas, A., Clark,H., Ouyang,w., Williams, M. P. , Wood, W.I. and Wu.T.D. 
Compositions and methods for the treatment of immune related 
diseases 

Patent: WO 2005016962-A 4390 24-FEB-2005; 
Genentech, Inc. (US) 

Locati on/Qual i f i ers 

1. .1097 

/orqanism="Homo sapiens" 
/mol_type="unassigned DNA n 
/db_xref="taxon:9606" 



Alignment Scores: 

Pred. No.: 2.72e-142 

Score: 168.00 

Percent similarity: 100.00% 
Best Local similarity: 100.00% 

Query Match: 96.00% 

DB: 6 



Length: 

Matches: 

Conservative: 

Mismatches: 

Indels: 

Gaps: 



1097 

168 

0 

0 

0 

0 



US-10-530-886-2 (1-175) x CS034884 (1-1097) 



Qy 

Db 



2 AraArqSerArqSerSerA^ 21 

, I I T I I III I I III I I I I I I I I lllllll Mil | | | || | | | | III | | | II I I I I I I 

2 CGGCGATCGAGGAGCTCTGCGGCCGCCAAGCTGCGCGGGCA^ 61 
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Untitled 

fn2 r ?e 9 !i ea t er V : u an or n ec iy a " 1 S° the score of the result being printed, 
and is denved by analysis of the total score distribution. 



SUMMARIES 



Result Query 

No. score Match Length DB ID 



Description 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 
41 
42 
43 
44 
45 
46 
47 
48 
49 
50 
51 
52 
53 
54 
55 



168 


96.0 1097 


168 


96.0 1097 


168 


96.0 1097 


168 


96.0 1097 


168 


96.0 1097 


152 


86.9 1191 


136 


77.7 161943 


136 


77.7 192307 


119 


68.0 170944 


109 


62.3 926 


84 


48.0 1797 


84 


48.0 1797 


72 


41.1 1005 


50 


28.6 579 


45 


25.7 723 


33 


18.9 181719 


33 


18.9 195339 


33 


18.9 322497 


30 


17.1 300 


27 


15.4 147951 


19 


10.9 193129 


19 


10.9 206722 


17 


9.7 2602 


13 


7.4 572 


13 


7.4 744 


13 


7.4 110001 


13 


7.4 349142 


12 


6.9 224 


12 


6.9 560 


12 


6.9 644 


12 


6.9 741 


12 


6.9 911 


12 


6.9 1203 


12 


6.9 2100 


12 


6.9 2510 


12 


6.9 3026 


12 


6.9 6297 


12 


6.9 62850 


12 


6.9 63451 


12 


6.9 79272 


12 


6.9 110000 


12 


6.9 110000 


12 


6.9 135378 


12 


6.9 143589 


12 


6.9 147009 


12 


6.9 150502 


12 


6.9 156517 


12 


6.9 157110 


12 


6.9 160562 


12 


6.9 161012 


12 


6.9 170871 


12 


6.9 175754 


12 


6.9 182841 


12 


6.9 185174 


12 


6.9 185321 



6 CS034884 
6 CS034886 
6 CS043836 
6 CS043838 
8 AF514992 
8 BC052604 
8 HS84M10 
8 AC005971 
8 AC093484 
8 HSA332538 
6 CS036951 
6 CS045903 
6 AR447632 
6 CQ724012 

8 HSA330089 
14 AL662863 

9 AL596181 
14 AC112109 
6 BD220196 
14 AC143662 
5 AL953908 

5 CR848813 
2 AK112512 
5 CR389739 
5 CR352613 

14 AC151687 
1 BX572599 

15 AY022843 

10 BV223329 
15 AY429384 
15 AF345338 
15 AF447888 
15 AB091677 
15 AB071299 
5 BC098960 
15 AK100322 
9 AF227194 

14 AC100009 
8 AL662886 

15 AC002338 

15 AP008212_281 
15 AP008212_282 

8 AL390060 

9 AC124745 

14 AC084179 
9 AC127563 

15 AP005395 

8 AL929101 

9 AC164299 
14 AP0042 77 

14 AC084135 

15 AP003766 
5 BX248326 
9 AC136004 
8 AC123908 
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CS034884 Sequence 
CS034886 Sequence 
CS043836 Sequence 
CS043838 Sequence 
AF514992 Homo sapi 
BC052604 Homo sapi 
AL 3 5 6460 Homo sapi 
AC005971 Homo sapi 
AC093484 Homo sapi 
AJ332538 Homo sapi 
CS036951 Sequence 
CS045903 Sequence 
AR447632 Sequence 
CQ724012 Sequence 
A3 3 30089 Homo sapi 

AL662863 Mus muscu 
AL596181 Mouse DNA 
AC112109 )Rattus no 
BD220196 Human gen 
AC143662 Macaca mu 
AL953908 Zebrafish 
CR848813 Zebrafish 
AK112512 Ciona int 
CR389739 Gall us ga 
CR352613 Callus ga 
AC151687 Gall us ga 
BX572599 Rhodopseu 
AY022843 Oryza sat 
BV223329 S233P6545 
AY429384 Arabidops 
AF345338 Arabidops 
AF447888 Arabidops 
AB091677 Peri pi oca 
AB071299 Oryza sat 
BC098960 Xenopus 1 
AK100322 Oryza sat 
AF227194 Mus muscu 

AC100009 Mus muscu 
AL662886 Human DNA 
AC002338 Arabidops 
Continuation (282 
Continuation (283 
AL390060 Human DNA 
AC124745 Mus muscu 
AC084179 Homo sapi 
AC127563 Mus muscu 

AP005395 Oryza sat 
AL929101 Human DNA 
AC164299 Mus muscu 
AP004277 oryza sat 
AC084135 Homo sapi 
AP003766 oryza sat 
BX248326 Zebrafish 
AC136004 Mus muscu 
AC123908 Homo sapi 




Untitled 

Gencore version 5.1.6 
Copyright (c) 1993 - 2006 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 

Run on: January 14, 2006, 00:57:07 ; Search time 3092 Seconds 

(without alignments) 
3217.210 Million cell updates/sec 

US-10-530-886-2 
175 

1 MRRSRSSAAAKLRGQKRSGA FEVWASADLHI FDCKYRN YI 175 



Title: 

Perfect score: 
Sequence: 

scoring table: 



OLIGO 

Xgapop 60.0 
Ygapop 60.0 
Fgapop 6.0 
Delop 6.0 



Xgapext 60.0 
Ygapext 60.0 
Fgapext 7.0 
Delext 7.0 



Searched: 
Word size: 



5883141 seqs, 28421725653 residues 
1 



11757827 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Listing first 75 summaries 

Command line parameters: 
-MODEL=f rame+_p2n. model -DEV=xl p 

-Q=/cgn2_l/USPTO_spool_p/USl0530886/runat_12012006_070942_6072/app_query.fasta_1.654 

-DB=GenEmbl -QFMT=fastap -SUFFIX=oligo. rge -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bitS -START=1 -END=-1 -MATRIX=ol i go -TRANS=human40 . cdi -LIST=75 
-DOCALIGN=200 -THR_SCORE=qual i ty -THR_MIN=1 -ALIGN=50 -MODE=LOCAL -OUTFMT=ptO 
-NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 

-USER=USl05308863CGN_l_l_77243runat^l2012006_070942.6072 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 
-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=60 -XGAPEXT=60 -FGAPOP=6 
-FGAPEXT=7 -YGAPOP=60 -YGAPEXT=60 -DELOP=6 -DELEXT=7 



Database 



GenEmbl : * 



3 
4 
5: 
6: 
7: 
8: 
9i 
10: 
11: 
12: 
13: 
14: 
15: 



gb_ba:* 
gb_in:* 
gb_env:* 
gb_om:* 
gb_ov:* 
gb_pat:* 
gb_ph:* 
gb_pr:* 
gb_ro:* 
gb_sts:* 
gb_sy:* 
gb_un:* 
gb_vi :* 
gb_htg:* 
gb_pl : * 



Pred. No. is the number of results predicted by chance to have a 
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RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer OF., Bhat N.K., 

RA Hopkins R.F., Jordan H. f Moore T., Max S.I., Wang 3., Hsieh F., 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M., Hong L., 

RA stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange c. f 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R. D. y Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards s., Worley K.C., Hale s., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., sodergren E.D., Lu X., Gibbs R.A. , 

RA Fahey 3 . , Helton E. f Ketteman M. f Madan A., Rodrigues s. f Sanchez A. f 

RA Whiting M., Madan A. f Young A.C., shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.c, Grimwood J., Schmutz 3 . , Myers R.M., 

RA Butterfield Y.S.N.. , Krzywinski M.I., skalska U. f Smailus D.E., 

RA Schnerch A., schein D.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

RN [2] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Blood; 

RA Director MGC Project; 

RL Submitted (MAY-2003) to the EMBL/GenBank/DDBJ databases. 

DR EMBL ; BC052604; AAH52604.1; -; mRNA. 

DR Ensembl; ENSG00000166582 ; Homo sapiens. 

DR HGNC; HGNC: 29920; PRR6. 

DR GO; G0:0016846; F: carbon-sulfur lyase activity; IEA. 

DR GO; GO:0008152; P:metabolism; IEA. 

DR interPro; IPR006913; GFA. 

DR Pfam; PF04828; DUF636; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 275 AA; 29946 MW; E9A69F1F9BA746D8 CRC64; 

Query Match 96.6%; Score 172; DB 2; Length 275; 

Best Local Similarity 100.0%; Pred. No. 3.6e-141; 

Matches 172; conservative 0; Mismatches 0; Indels 0; Gaps 0 
Qy 1 MRRSRSSAAAKLRGQKRSGASGASAAPAASAAAALAPSATRTRRSASQAGSKSQAVEKPP 60 

I I I I I I I I I I I I I I I M I 1 1 I 1 1 I 1 1 1 1 1 I I 1 1 1 I 1 1 I I II I I I I I I I I 1 1 I 1 1 II I 1 1 I 

Db 1 MRRSRSSAAAKLRGQKRSGASGASAAPAASAAAALAPSATRTRRSASQAGSKSQAVEKPP 60 

Qy 61 SEKPRLRRSSPRAQEEGPGEPPPPELALLPPPPPPPPTPATPTSSASNLDLGEQRERWET 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 SEKPRLRRSSPRAQEEGPGEPPPPELALLPPPPPPPPTPATPTSSASNLDLGEQRERWET 120 

Qy 121 FQKRQKLTSEGAAKLLLDTFEYQGLVKHTGGCHCGAVRFEVWASADLHIFDC 172 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 FQKRQKLTSEGAAKLLLDTFEYQGLVKHTGGCHCGAVRFEVWASADLHIFDC 172 
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30 


10 


5.6 


225 


2 


31 


10 


5.6 


230 


2 


32 


10 


5.6 


230 


2 


33 


10 


5.6 


232 


1 


34 


10 


5.6 


266 


2 


35 


10 


5.6 


273 


2 


36 


10 


5.6 


279 


2 


37 


10 


5.6 


282 


2 


38 


10 


5.6 


283 


2 


39 


10 


5.6 


294 


2 


40 


10 


5.6 


300 


2 


41 


10 


5.6 


304 


1 


42 


10 


5.6 


318 


2 


43 


10 


5.6 


331 


2 


44 


10 


5.6 


356 


2 


45 


10 


5.6 


373 


1 


46 


10 


5.6 


396 


2 


47 


10 


5.6 


398 


1 


48 


10 


5.6 


398 


2 


49 


10 


5.6 


412 


2 


50 


10 


5.6 


421 


1 


51 


10 


5.6 


421 


2 


52 


10 


5.6 


427 


2 


53 


10 


5.6 


457 


2 


54 


10 


5.6 


461 


2 


55 


10 


5.6 


468 


2 


56 


10 


5.6 


472 


2 


57 


10 


5.6 


476 


2 


58 


10 


5.6 


476 


2 


59 


10 


5.6 


478 


2 


60 


10 


5.6 


489 


2 


61 


10 


5.6 


495 


2 


62 


10 


5.6 


497 


2 


63 


10 


5.6 


519 


2 


64 


10 


5.6 


545 


2 


65 


10 


5.6 


585 


2 


66 


10 


5.6 


592 


2 


67 


10 


5.6 


618 


2 


68 


10 


5.6 


618 


2 


69 


10 


5.6 


666 


2 


70 


10 


5.6 


666 


2 


71 


10 


5.6 


670 


2 


72 


10 


5.6 


670 


2 


73 


10 


5.6 


672 


2 


74 


10 


5.6 


687 


2 


75 


10 


5.6 


698 


2 



Untitled 
Q84NL2_ORYSA 
Q7XV57_ORYSA 
Q84275_HPV47 
ACROL_HUMAN 
Q52MA4_XENTR 
Q5ZBY8_ORYSA 
Q5C8Q0_9ADEN 
Q8BRU0_MOUSE 
Q6EPA2_ORYSA 
Q8N144_HUMAN 
Q4THH6_TETNG 
VE4_HPV47 
Q53JG2_ORYSA 
Q6P0G2_BRARE 
Q8LIR8_ORYSA 
FBLI1_HUMAN 
Q6NTF6_HUMAN 
SP5_MOUSE 
Q6BEB4_HUMAN 
Q5CAF6_ORYSA 
ACRO_HUMAN 
Q6ICK2_HUMAN 
Q6ETL2_ORYSA 
Q52KK4_RAT 
Q5BJ2LBRARE 
Q6P94LBRARE 
Q5U4A3_XENLA 
Q9SCT7_ARATH 
Q5ZKA6_CHICK 
Q51TL5_MAGGR 
Q86XT7_HUMAN 
Q6C8G0_YARLI 
Q54LW2_DICDI 
04865 5_ORYSA 
Q6YS18_0RYSA 
Q6RJN6_PHYPA 
Q6RJN7_PHYPA 
Q8S6K9_ORYSA 
Q99KV8_MOUSE 
Q7TVG2_MYCBO 
O69740_MYCTU 
Q8RVE5_ORYSA 
Q593S5_CYPCA 
Q8l_MH2_ORYSA 
Q8BKD3_MOUSE 
Q7XF73_ORYSA 



Q84nl2 
Q7xv57 
Q84275 
P58840 
Q52ma4 
Q5zby8 
Q5c8q0 
Q8bru0 
Q6epa2 
Q8nl44 
Q4thh6 
P22421 
Q53jg2 
Q6p0g2 
Q81ir8 
Q8wup2 
Q6ntf6 
Q9ihx2 
Q6beb4 
Q5caf6 
P10323 
Q6ick2 
Q6etl2 
Q52kk4 
Q5bj21 
Q6p941 
Q5u4a3 
Q9sct7 
Q5zka6 
Q51tl5 
Q86xt7 
Q6c8g0 
Q54lw2 
048655 
Q6ysl8 
Q6rj"n6 
Q6rjn7 
Q8s6k9 
Q99kv8 
Q7tvg2 
069740 
Q8rve5 
Q593s5 
Q8lmh2 
Q8bkd3 
Q7xf73 



oryza sativ 
oryza sativ 
human papil 
homo sapien 
xenopus tro 
oryza sativ 
simian aden 
mus musculu 
oryza sativ 
homo sapien 
tetraodon n 
human papil 
oryza sativ 
brachydanio 
oryza sativ 
homo sapien 
homo sapien 
mus musculu 
homo sapien 
oryza sativ 
homo sapien 
homo sapien 
oryza sativ 
rattus norv 
brachydanio 
brachydanio 
xenopus lae 
arabidopsis 
gall us gall 
magnaporthe 
homo sapien 
yarrowia li 
dictyosteli 
oryza sativ 
oryza sativ 
physcomi tre 
physcomitre 
oryza sativ 
mus musculu 
mycobacteri 
mycobacteri 
oryza sativ 
cyprinus ca 
oryza sativ 
mus musculu 
oryza sativ 



ALIGNMENTS 



RESULT 1 
Q7Z7K6_HUMAN 

ID Q7Z7K6_HUMAN PRELIMINARY; PRT; 275 AA. 

AC Q7Z7K6; 

DT 01-OCT-2003 (TrEMBLrel . 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-MAR-2004 (TrEMBLrel. 26, Last annotation update) 

DE Hypothetical protein. 

GN • Name=PRR6; 

OS Homo sapiens (Human). 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

oc Mammalia; Eutheria; Euarchontogli res ; Primates; Catarrhini; Hominidae; 

OC Homo . 

OX NCBl_TaxlD=9606; 
RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Blood; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10. 1073/pnas . 242603899; 

RA strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., shenmen CM. , Schuler G.D., 
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untitled 



copyright 



Gencore version 
(c) 1993 - 2006 



5-1.6 
Compugen 



Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 
Word size 



January 12, 2006, 12:44:40 ; Search time 116.986 Seconds 

(without alignments) 
1073.498 Million cell updates/sec 

US-10-530-886-4 
178 

1 MRRSRSSAAAKLRGQKRSGA FE VWASADLHI FDCKYRN YI 178 

OLIGO 

Gapop 60.0 , Gapext 60.0 
2166443 seqs, 705528306 residues 
0 



Total number of hits satisfying chosen parameters: 2166443 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Listing first 75 summaries 

Database : UniProt_05 . 80:* 

1: uniprot_sprot:* 
2: uniprot_trembl :* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



10 Description 



Result 




Query 




No. 


Score 


Match 


Length DB 


1 


172 


96.6 


275 2 


2 


152 


85.4 


271 2 


3 


43 


24.2 


252 2 


4 


43 


24.2 


252 2 


5 


17 


9.6 


569 2 


6 


13 


7.3 


141 2 


7 


12 


6.7 


268 1 


8 


12 


6.7 


365 2 


9 


12 


6.7 


700 2 


10 


12 


6.7 


700 2 


11 


12 


6.7 


2025 2 


12 


11 


6.2 


136 2 


13 


11 


6.2 


301 2 


14 


11 


6.2 


425 2 


15 


11 


6.2 


588 2 


16 


11 


6.2 


613 2 


17 


10 


5.6 


65 2 


•18 


10 


5.6 


65 2 


19 


10 


5.6 


74 2 


20 


10 


5.6 


130 2 


21 


10 


5.6 


134 2 


22 


10 


5.6 


137 2 


23 


10 


5.6 


137 2 


24 


10 


5.6 


137 2 


25 


10 


5.6 


138 2 


26 


10 


- 5.6 


. 151 1 


27 


10 


5.6 


158 2 


28 


10 


5.6 


185 1 


29 


10 


5.6 


221 2 



Q7Z7K6_HUMAN 
Q8NFH6_HUMAN 
Q5SX2L_MOUSE 
Q9CXS4_MOUSE 
Q4SLH6_TETNG 
Q6N8J6_RHOPA 
LBD13_ARATH 
Q4KLX2_XENLA 
Q8S976_ORYSA 
Q653H7_ORYSA 
Q99PP2_MOUSE 
Q9CV31_M0USE 
Q6L4DLORYSA 
Q8ND23_HUMAN 
Q5XHYLRAT 
Q6ZLDl_ORYSA 
Q8QTCl_WSSV 
Q8VAQ0_WSSV 
Q59EF8_HUMAN 
Q4JlV7_AZOVI 
Q5DGR5_SCHJA 
Q69U86_ORYSA 
Q4JMU1_9BACT 
Q7NFCl_GLOVI 
Q7QE51_ANOGA 
YWIS_WHEAT 
Q9DVW0_9BACU 
N020_SOYBN 
049946_SOLTU 
Page 1 



Q7z7k6 

Q8nfh6 

Q5sx21 

Q9cxs4 

Q4slh6 

Q6n8j6 

Q9at61 

Q4klx2 

Q8s976 

Q653h7 

Q99pp2 

Q9cvjl 

Q6l4dl 

Q8nd23 

QSxhyl 

Q6zldl 

Q8qtcl 

Q8vaq0 

Q59e?8 

Q4jlv7 

Q5dgr5 

Q69u86 

Q4jmul 

Q7nfcl 

Q7qe51 

P14723 

Q9dvw0 

P08960 

049946 



homo sapien 
homo sapien 
mus musculu 
mus musculu 
tetraodon n 
rhodopseudo 
arabidopsis 
xenopus lae 
oryza sativ 
oryza sativ 
mus musculu 
mus musculu 
oryza sativ 
homo sapien 
rattus norv 
oryza sativ 
white spot 
white spot 
homo sapien 
azotobacter 
schistosoma 
oryza sativ 
uncultured 
gloeobacter 
anopheles g 
triticum ae 
plutella xy 
glycine max 
soianum tub 




Untitled 

GenCore version 5.1.6 
Copyright (c) 1993 - 2006 Compugen Ltd. 

OM protein - nucleic search, using f rame_plus_p2n model 

Run on: January 14, 2006, 00:57:07 ; search time 3092 Seconds 

(without alignments) 
3217.210 Million cell updates/sec 

US-10-530-886-2 
175 

1 MRRSRSSAAAKLRGQKRSGA FEVWASADLHI FDCKYRNYI 175 



Title: 

Perfect score: 
Sequence: 

scoring table: 



OLIGO 

Xgapop 60.0 
Ygapop 60.0 
Fgapop 6.0 
Delop 6.0 



Xgapext 60.0 
Ygapext 60.0 
Fgapext 7.0 
Delext 7.0 



Searched: 
Word size: 



5883141 seqs, 28421725653 residues 
1 



11757827 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Listing first 75 summaries 

command line parameters: 
-MODEL=f rame+_p2n. model -DEV=xlp 

-Q=/cgn2_l/USPTO_spool_p/USl0530886/runat_12012006_070942_6072/app_query.fasta_1.654 

-DB=GenEmbl -QFMT=fastap -SUFFIX=ol igo . rge -MINMATCH=0. 1 -LOOPCL-0 -LOOPEXT=0 
-UNITS=bitS -START=1 -END=-1 -MATRlX=ol i go -TRANS=human40 . cdi -LIST=75 
-DOCALIGN=200 -THR_SCORE=qual i ty -THR_MIN=1 -ALIGN=50 -MODE=LOCAL -OUTFMT=ptO 
-N0RM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 

-USER=USl0530886_OCGN - .l_l_7724_Orunat_12012006_070942_6072 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 
-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=60 -XGAPEXT=60 -FGAPOP=6 
-FGAPEXT=7 -YGAPOP=60 -YGAPEXT=60 -DELOP=6 -DELEXT=7 



Database : 



GenEmbl 



1 

2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 



gbj 

gb_ 
gb_< 
gb_< 
gb_i 
gb_i 
gb_i 

gb_ 

gb 



_ba:* 
.in:* 
.env: * 
.om:* 
.ov:* 
.pat: * 
.ph:* 
.pr : * 
_ro:* 
>_sts:* 
>_sy:* 
>_un: * 
>_vi : * 
>_htg:* 
)_pl : * 



Pred. No. is the number of results predicted by chance to have a 
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Untitled 

score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 
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NO. 


Score 


Query 
Matcn 


Length DB 


ID 


Description 
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6 


CS034884 


CS034884 Sequence 




2 


168 


96.0 


1097 


6 


CS034886 


CS034886 Sequence 




3 


168 


96.0 


1097 


6 


CS043836 


CS043836 Sequence 




4 


168 


96.0 


1097 


6 


CS043838 


CS043838 Sequence 




5 


168 


96.0 


1097 


8 


AF514992 


AF5 14992 Homo sapi 




6 


152 


86.9 


1191 


8 


BC052604 


BC052604 Homo sapi 




7 


136 


77. 7 


161943 


8 


HS84M10 


AL356460 Homo sapi 




8 


136 


77. 7 


192307 


8 


AC005971 


AC005971 Homo saDi 


c 


9 


119 


68.0 


170944 


8 


AC093484 


AC093484 Homo sapi 




10 


109 


62. 3 


926 


8 


HSA332538 


AJ332538 Homo sapi 




11 


84 


48.0 


1797 


6 


CS036951 


CS036951 Sequence 




12 


84 


48.0 


1797 


6 


CS045903 


CS045903 Sequence 




13 


72 


41. 1 


1005 


6 


AR447632 


AR447632 Sequence 




14 


50 


28.6 


579 


6 


CQ724012 


CQ724012 Sequence 




15 


45 


25.7 


723 


8 


HSA330089 


AJ330089 Homo sapi 


c 


16 


33 


18.9 


181719 


14 


AL662863 


AL662863 Mus muscu 




17 


33 


18.9 


195339 


9 


AL596181 


AL596181 Mouse DNA 




18 


33 


18.9 


322497 


14 


AC112109 


AC112109 Rattus no 




19 


30 


17. 1 


300 


6 


BD220196 

\m+ U 4— 4— \J W V/ 


BD220196 Human aen 




20 


27 


15.4 


147951 


14 


AC143662 


AC143662 Macaca mu 


c 


21 


19 


10.9 


193129 


5 


AL953908 


AL953908 Zebrafish 




22 


19 


10.9 


206722 


5 


CR848813 


CR848813 Zebrafish 


c 


23 


17 


9.7 


2602 


2 


AK112512 


AK112512 ciona int 




24 


13 


7.4 


572 


5 


CR389739 


CR389739 Gallus ga 




25 


13 


7.4 


744 


5 


CR352613 


CR352613 Gallus ga 




26 


13 


7.4 


110001 


14 


AC151687 


AC151687 Gallus ga 




27 


13 


7.4 


349142 


1 


BX572599 


BX572599 Rhodopseu 


c 


28 


12 


6.9 


224 


15 


AY022843 


AY022843 Oryza sat 


c 


29 


12 


6.9 


560 


10 


BV223329 


BV223329 S233P6545 




30 


12 


6.9 


644 


15 


AY429384 


AY429384 Arabidops 




31 


12 


6.9 


741 


15 


AF345338 


AF345338 Arabidops 




32 


12 


6.9 


911 


15 


AF447888 


AF447888 Arabidops 




33 


12 


6.9 


1203 


15 


AB091677 


AB091677 Peri pi oca 




34 


12 


6.9 


2100 


15 


AB071299 


AB071299 oryza sat 




35 


12 


6.9 


2510 


5 


BC098960 

fc* V* \J *J V/ mS \J V 


BC098960 Xenopus 1 




36 


12 


6.9 


3026 


15 


AK100322 


AK100322 Orvza sat 




37 


12 


6.9 


6297 


9 


AF227194 


AF227194 Mus muscu 


c 


38 


12 


6.9 


62850 


14 
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AC100009 Mus muscu 




39 


12 


6.9 


63451 


8 


AL662886 
*» i— \j \j i— \j \j \j 


AL662886 Human DNA 


c 


40 


12 


6 [ 9 


79272 


15 


AC002338 


AC002338 Arabidoos 




41 


12 


6*9 


110000 


15 


AP008212 281 


Continuation (2R2 




42 


12 


6.9 


110000 


15 


AP008212_282 


Continuation (283 




43 


12 


6.9 


135378 


8 


AL390060 


AL390060 Human DNA 




44 


12 


6.9 


143589 


9 


AC124745 


AC124745 Mus muscu 




45 


12 


6.9 


147009 


14 


AC084179 


AC084179 Homo sapi 




46 


12 


6.9 


150502 


9 


AC127563 


AC127563 Mus muscu 




47 


12 


6.9 


156517 


15 


AP005395 


AP005395 oryza sat 




48 


12 


6.9 


157110 


8 


AL929101 


AL929101 Human DNA 




49 


12 


6.9 


160562 


9 


AC164299 


AC164299 Mus muscu 




50 


12 


6.9 


161012 


14 


AP004277 


AP004277 Oryza sat 


c 


51 


12 


6.9 


170871 


14 


AC084135 


AC084135 Homo sapi 




52 


12 


6.9 


175754 


15 


AP003766 


AP003766 Oryza sat 




53 


12 


6.9 


182841 


5 


BX248326 


BX248326 Zebrafish 




54 


12 


6.9 


185174 


9 


AC136004 


AC136004 Mus muscu 




55 


12 


6.9 


185321 


8 


AC123908 


AC123908 Homo sapi 



Page 2 



Untitled 



c 


56 


12 


6. 


9 


188633 


9 


AL606932 


AL606932 Mouse DNA 


c 


57 


12 


6. 


9 


193411 


14 


BX813304 


BX813304 Danio rer 


c 


58 


12 


6. 


9 


200568 


14 


AC093318 


AC093318 Mus muscu 




59 


12 


6. 


9 


201389 


14 


AC093358 


AC093358 Mus muscu 


c 


60 


12 


6. 


9 


202932 


9 


AC153890 


AC153890 Mus muscu 




61 


12 


6. 


9 


204134 


9 


AC145116 


AC145116 Mus muscu 


c 


62 


12 


6. 


9 


207105 


14 


AC123332 


AC123332 Rattus no 


c 


63 


12 


6. 


9 


210145 


5 


CR387996 


CR387996 Zebrafish 


c 


64 


12 


6. 


9 


210969 


9 


AC139758 


AC139758 Mus muscu 




65 


12 


6. 


9 


216290 


14 


AC151275 


AC151275 Mus muscu 




66 


12 


6. 


9 


224054 


14 


AC097089 


AC097089 Rattus no 




67 


12 


6. 


9 


230871 


14 


AC162361 


AC162361 Mus muscu 


c 


68 


12 


6. 


9 


230951 


14 


AC120683 


AC120683 Rattus no 




69 


12 


6. 


9 


235703 


9 


AC113970 


AC113970 Mus muscu 


c 


70 


12 


6. 


9 


244853 


14 


AC103346 


AC103346 Rattus no 


c 


71 


12 


6. 


9 


253961 


14 


AC128235 


AC128235 Rattus no 


c 


72 


12 


6. 


9 


254581 


14 


AC111674 


AC111674 Rattus no 


c 


73 


12 


6. 


9 


256496 


9 


AC158239 


AC158239 Mus muscu 




74 


12 


6. 


9 


265797 


9 


AC099884 


AC099884 Mus muscu 




75 


12 


6. 


9 


271069 


14 


AC127915 


AC127915 Rattus no 



ALIGNMENTS 



RESULT 1 
CS034884 
LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



ORIGIN 



CS034884 1097 bp DNA 

Sequence 4390 from Patent WO2005016962 . 
CS034884 

CS034884.1 Gl:60733237 



linear PAT 10-MAR-2005 



Homo sapiens (human) 
Homo sapiens 
Eukaryota; Metazoa; 
Mammal i a ; Eut he ri a ; 
Hominidae; Homo. 
1 

Abbas, A. , Clark.H., Ouyang,W 
Compositions and methods for 
diseases 

Patent: WO 2005016962-A 4390 24-FEB-2005 
Genentech, Inc. (US) 

Locati on/Qual i f i e rs 

1. .1097 

/orqanism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref="taxon:9606" 



chordata; craniata; Vertebrata; Euteleostomi 
Euarchontogli res; Primates; catarrhini ; 



, Wi Hi ams , M . P. , Wood.W. 
the treatment of immune 



I. and Wu,T. 
related 



Alignment scores: 
Pred. No. : 
Score: 

Percent Similarity: 
Best Local Similarity: 
Query Match: 
DB: 



2.72e-142 


Length : 


1097 


168.00 


Matches : 


168 


100.00% 


Conservative: 


0 


100.00% 


Mismatches: 


0 


96.00% 


Indels: 


0 


6 


Gaps: 


0 



US-10-530-886-2 (1-175) x CS034884 (1-1097) 

Qy 

Db 



2 ArgArgserArgSerserAlaAlaAlaLysLeuArgGlyGlnLysArgserGlyAlaSer 21 

IITIITIMIITIIIIIIMIIMIIIIIIIIMITIIIIIIIIIIITIIIIIIIIIIII 

2 CGGCGATCGAGGAGCTCTGCGGCCGCCAAGCTGCGCGGGCAGAAGCGGTCCGGGGCCTCC 61 
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62 



QV 22 

Db 62 

Qy 42 
Db 

Qy 

Db 

Qy 82 

Db 242 

Qy 102 

Db 302 

Qy 

Db 

Qy 

Db 

Qy 162 
Db 482 



Untitled 

Al aAl aProAl aAl aSerAl aAl aAl aAl aLeuAl aProSerAl aThrArgThrArgArg 41 
M 1 1 1 1 1 1 1 1 I INI 1 1 1 1 1 1 II! Ml II 111 I II 1 1 II I 1 1 1 1 1 1 1 1 1 III 1 1 1 111 IT 

GCGGCCCCCGCGGCCTCCGCGGCCGCTGCCTTGGCACCCAGCGCCACCCGCACACGGCGC 121 



SerAl aSerGl nAl aGlySerLysSerGl nAl aval Gl uLysProProSerGl uLysPro 61 
1 1 1 1 1 1 1 1 II II 1 1 1 1 II 1 1 1 1 1 1 Mill I MM I III I II I 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 

12 2 TCCGCTAGCCAGGCCGGGAGCAAGAGCCAGGCGGTGGAGAAGCCGCCGTCGGAGAAGCCG 18 1 



81 



ArgLeuArgArgSerSerProArgAl aGl nGl uGl uGl yProGlyGl uProProProPro 

MTMMITmTMMMMMITMMMMMMMMMIMMIMMMMMM 
182 cggctgaggcgctcgtcgccgcgggcccaggaggagggcccgggggagccgccgccgcct 241 



Gl uLeuAlaLeuLeuProProProProProProProProThrProAl aThrProThrSer 
I M 1 1 II 1 1 1 MMM II M I II I M I M I M M 1 1 M 1 1 MMMMMMIMMM! 

GAGCTGGCGTTGCTCCCGCCACCGCCGCCGCCGCCGCCGACTCCCGCGACCCCGACGTCC 



101 
301 
121 



SerAl aSerAsnLeuAspLeuGl yGl uGl nArgGl uArgTrpGl uThrPheGl nLysArg 
I M I M I M 1 1 M M I II M I M MM IM II II M I 111 II II M I II I II II 1 1 1 1 II 

TCGGCGTCCAACCTGGACCTGGGCGAGCAGCGGGAGCGCTGGGAGACGTTCCAGAAGCGG 361 



122 



Gl nl_ysl_euThrSerGl uGl yAl aAl aLysLeuLeuLeuAspThrPheGl uTyrGl nGly 141 
I II II M I M M I II I II II I II II II II II II II II II II 1 1 M I II I II MMM II I 

362 CAGAAGCTTACCTCCGAGGGTGCCGCCAAGCTCCTGCTAGACACCTTTGAATACCAGGGC 42 1 



142 



161 



LeuValLysHisThrGlyGlyCysHisCysGlyAl aValArgPheGluValTrpAl aSer 
I II II II M I II I II II I II I Ml II II M M II II II II MM I II I II M II II M II 

422 CTGGTGAAGCACACAGGAGGCTGCCACTGTGGAGCAGTTCGTTTTGAAGTTTGGGCCTCA 481 

AlaAspLeuHisllePheAspCys 169 
I II M II II M II M Ml II I II I 

GCAGACTTGCATATATTTGACTGC 505 



RESULT 2 
CS034886 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



ORIGIN 



CS034886 1097 bp DNA 

Sequence 4392 from Patent WO2005016962 . 
CS034886 

CS034886.1 Gl:60733238 



linear PAT 10-MAR-2005 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi 
Mammalia; Eutheria; Euarchontogli res ; Primates; Catarrhini; 
Hominidae; Homo. 



Abbas, A., Clark, H., Ouyang,W 

for 



Compositions and methods 
diseases 

Patent: WO 2005016962-A 4392 24-FEB-2005; 
Genentech, Inc. (us) 

Locati on/Qual i f i ers 

1. .1097 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref="taxon:9606" 



, Williams, M. P. , Wood,W.I. and wu.T.D. 
the treatment of immune related 



Alignment Scores: 

Pred. No.: 2.72e-142 

Score: 168.00 

Percent similarity: 100.00% 

Best Local Similarity: 100.00% 

Query Match: 96.00% 



Length: 
Matches: 
conservative: 
Mismatches: 
Indels : 
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1097 

168 

0 

0 

0 



DB: 



Untitled 
Gaps: 



US-10-530-886-2 (1-175) x CS034886 (1-1097) 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

RESULT 3 
CS043836 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



2 ArgArgSerArgSerSerAlaAlaAlaLysLeuArgGlyGlnLysArgSerGlyAlaSer 21 
MIMIMMMMMMMMMMMMMMMMMMMMMMMMMMM 

2 CGGCGATCGAGGAGCTCTGCGGCCGCCAAGCTGCGCGGGCAGAAGCGGTCCGGGGCCTCC 61 

22 AlaAlaProAlaAlaSerAlaAlaAlaAlaLeuAlaProSerAlaThrArgThrArgArq 41 
M I M M I I M I I I I M I I I M II I I I I II I I I IN I I I I I I M I I I Mill M I llM I 

62 GCGGCCCCCGCGGCCTCCGCGGCCGCTGCCTTGGCACCCAGCGCCACCCGCACACGGCGC 12 1 

42 SerAl aSerGl nAl aGlySerLysSerGl nAl aValGl uLysProProSerGl uLysPro 61 
I 1 1 1 1 1 1 1 1 1 I II 1 1 1 II I II I II 1 1 1 1 1 1 1 1 MM 1 1 1 1 1 1 1 1 1 II 1 1! 1 1 1! 1 1 1 1 1 1 

12 2 TCCGCTAGCCAGGCCGGGAGCAAGAGCCAGGCGGTGGAGAAGCCGCCGTCGGAGAAGCCG 18 1 

62 ArgLeuArgArgSerSerProArgAlaGlnGluGl uGlyProGlyGluProProProPro 81 
Ml 1 1 1 1 111 MM I MM II I Ml 1 1 1 1 1 1 1 1 M M 1 1 II I M II 1 1 II 1 1 1 II I II 1 1 

182 CGGCTGAGGCGCTCGTCGCCGCGGGCCCAGGAGGAGGGCCCGGGGGAGCCGCCGCCGCCT 241 



82 GluLeuAlaLeuLeuProProProProProProProProThrProAlaThrProThrSer 
MMMMMMMMMMMMMMIMMMMMMMM MMMMMMMI 

242 GAGCTGGCGTTGCTCCCGCCACCGCCGCCGCCGCCGCCGACTCCCGCGACCCCGACGTCC 



101 
301 
121 



102 SerAl aSerAsnLeuAspLeuGlyGl uGl nArgGl uArgTrpGl uThrPheGl nLysArg 
MMM MM MM MM I II 1 1 1 1 II II I II 1 II 1 1 II 1 1 1 II I MMMMMMMI 

302 TCGGCGTCCAACCTGGACCTGGGCGAGCAGCGGGAGCGCTGGGAGACGTTCCAGAAGCGG 361 



122 GlnLysLeuThrSerGl uGlyAl aAlaLysLeuLeuLeuAspThrPheGl uTyrGlnGly 141 
I 1 1 1 1 I 1 1 1 1 I M 1 1 1 1 M I II I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMMMIIMM III III 

362 CAGAAGCTTACCTCCGAGGGTGCCGCCAAGCTCCTGCTAGACACCTTTGAATACCAGGGC 42 1 



142 



161 



Leuval LysHi sThrGl yGl yCysHi sCysGl yAl aval ArgPheGl uValTrpAl aser 
1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II I M M I M I M M 1 1 1 M I M M 

42 2 CTGGTGAAGCACACAGGAGGCTGCCACTGTGGAGCAGTTCGTTTTGAAGTTTGGGCCTCA 481 



162 AlaAspLeuHisllePheAspCys 169 
MMMIMMMMMMMMI 

482 GCAGACTTGCATATATTTGACTGC 505 



CS043836 1097 bp DNA 

Sequence 4390 from Patent WO2005019258. 
CS043836 

CS043836.1 Gl:61850672 



linear PAT 22-MAR-2005 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontogli res ; Primates; catarrhini; 
Hominidae; Homo. 
1 

Abbas, A., Clark, H., ouyang,w., Williams, P.M. , Wood, W.I. and Wu,T.D. 
compositions and methods for the treatment of immune related 
diseases 

Patent: WO 2005019258-A 4390 03-MAR-2005; 
Genentech, Inc. (us) 

Locati on/Qual i f i ers 

1. .1097 

/organism= ,, Homo sapiens" 
/mol„type= n unassigned DNA" 
/db_xref="taxon:9606" 
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ORIGIN 



Untitled 



Alignment Scores: 

Pred. No.: 2.72e-142 

Score: 168.00 

Percent Similarity: 100.00% 
Best Local Similarity: 100.00% 

Query Match: 96.00% 

DB: 6 



Length: 

Matches: 

Conservative: 

Mismatches: 

Indels: 

Gaps: 



1097 

168 

0 

0 

0 

0 



US-10-530-886-2 (1-175) x CS043836 (1-1097) 



Qy 

Db 

QY 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

RESULT 4 
CS043838 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 



2 ArgArgSerArqSerSerAlaAlaAlaLysLeuArgGlyGlnLysArgSerGlyAlaSer 21 

I III III I I I II I II I I I I I I II I I I I I I I I I I I I II I I I I I I I I I II I II I I I I ^ 
2 CGGCGATCGAGGAGCTCTGCGGCCGCCAAGCTGCGCGGGCAGAAGCGGTCCGGGGCCTCC 61 

22 Al aAl aProAl aAl aSerAl aAl aAl aAl aLeuAl aProSerAl aThrArgThrArgArg 41 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I M 1 1 1 II I M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 I II 1 1 1 1 111 II 

62 GCGGCCCCCGCGGCCTCCGCGGCCGCTGCCTTGGCACCCAGCGCCACCCGCACACGGCGC 12 1 

42 SerAlaSerGlnAlaGlySerLysSerGlnAl aval GluLysProProSerGluLys Pro 61 
I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I II I I I I I I I I I I I I II I I I I I I I I I 
12 2 TCCGCTAGCCAGGCCGGGAGCAAGAGCCAGGCGGTGGAGAAGCCGCCGTCGGAGAAGCCG 18 1 



62 



81 



ArgLeuArgArgSerSerProArgAl aGlnGl uGluGlyProGlyGl uProProProPro 
I II 1 1 M 111 I1M I 1 1 1 I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 I 

182 CGGCTGAGGCGCTCGTCGCCGCGGGCCCAGGAGGAGGGCCCGGGGGAGCCGCCGCCGCCT 241 



101 
301 
121 



82 GluLeuAlaLeuLeuProProProProProProProProThrProAlaThrProThrSer 
1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 I I 1 1 Ml I I I 1 1 1 1 I I 1 1 1 1 1 1 I I I I 1 1 1 1 I i II 1 1 ! I II 

242 GAGCTGGCGTTGCTCCCGCCACCGCCGCCGCCGCCGCCGACTCCCGCGACCCCGACGTCC 

102 SerAl aSerAsnLeuAspLeuGl yGl uGl nArgGl uArgTrpGl uThrPheGl nLysArg 
1 1 1 1 M I I M 1 1 1 1 1 I II M 1 1 1 1 1 1 1 1 1 1 1 III I M 111 I M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 

302 TCGGCGTCCAACCTGGACCTGGGCGAGCAGCGGGAGCGCTGGGAGACGTTCCAGAAGCGG 361 

122 Gl nLysLeuThrSerGl uGl yAl aAl aLysLeuLeuLeuAspThrPheGl uTyrGl nGl y 141 

I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I M II I I I I I I I I I I I I I I I I I 
362 CAGAAGCTTACCTCCGAGGGTGCCGCCAAGCTCCTGCTAGACACCTTTGAATACCAGGGC 421 

142 LeuValLysHisThrGl yGl yCysHisCysGl yAl aval ArgPheGluValTrpAlaSer 161 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 1 I I I I I I I I I I I I I I I I I I 
42 2 CTGGTGAAGCACACAGGAGGCTGCCACTGTGGAGCAGTTCGTTTTGAAGTTTGGGCCTCA 481 



162 AlaAspLeuHisIlePheAspCys 169 

I I I I I I I I I I I I I I I I I I I I I I I I 
482 GCAGACTTGCATATATTTGACTGC 505 



CS043838 1097 bp DNA 

Sequence 4392 from Patent WO2005019258. 
CS043838 

CS043838.1 Gl:61850673 



linear PAT 22-MAR-2005 



Homo sapiens (human) 
Homo sapiens 
Eukaryota; Metazoa; 
Mammal i a ; Eutheri a ; 
Homi ni dae ; Homo . 
1 

Abbas, A., Clark, H. f Ouyang,W 
Compositions and methods for 



chordata; Craniata; Vertebrata; Euteleostomi 
Euarchontoglires; Primates; Catarrhini; 



, Williams, P.M. , Wood,W. 
the treatment of immune 



I. and Wu,T.D. 
related 



Page 6 



JOURNAL 

FEATURES 

source 



ORIGIN 



Untitled 

diseases 

Patent: WO 2005019258-A 4392 03-MAR-2005; 
Genentech, Inc. (us) 

Locati on/Qual i f i e rs 

1. .1097 

/orqanism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref="taxon:9606" 



Alignment Scores: 

Pred. No.: 2.72e-142 

Score: 168.00 

Percent similarity: 100.00% 
Best Local Similarity: 100.00% 

Query Match: 96.00% 

DB: 6 



Length: 
Matches : 
Conservative: 
Mismatches: 
Indels: 
Gaps : 



1097 

168 

0 

0 

0 

0 



US-10-530-886-2 (1-175) x CS043838 (1-1097) 



Qy 

Db 

Qy 

Db 

Qy 

Db 
Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

D 



2 ArgArgSerArgSerSerAlaAlaAlaLysLeuArgGlyGlnLysArgSerGlyAlaSer 21 
I Ml 111 I I I 111 I I I I I I I I I I I I I I I I I I I I I 111 I I I I I I I I I 111 I I I I I I 1 I I I I 

2 CGGCGATCGAGGAGCTCTGCGGCCGCCAAGCTGCGCGGGCAGAAGCGGTCCGGGGCCTCC 61 
22 Al aAl aProAl aAl aSerAl aAl aAl aAl aLeuAl aProSerAl aThrArgThrArgArg 41 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 III IT 

62 GCGGCCCCCGCGGCCTCCGCGGCCGCTGCCTTGGCACCCAGCGCCACCCGCACACGGCGC 12 1 



42 



82 
242 
102 
302 



162 
482 




61 



serAl aSerGl nAl aGl ySerLysSerGl nAl aval Gl uLysProProSerGl uLysPro 
IIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIII 

12 2 TCCGCTAGCCAGGCCGGGAGCAAGAGCCAGGCGGTGGAGAAGCCGCCGTCGGAGAAGCCG 18 1 
62 ArgLeuArgArgSerSerProArgAlaGlnGl uGluGlyProGlyGl uProProProPro 81 

llTlllllTlllllllllllMlTlllllltlllllllllllllllllllllllllllll 

182 CGGCTGAGGCGCTCGTCGCCGCGGGCCCAGGAGGAGGGCCCGGGGGAGCCGCCGCCGCCT 241 



GluLeuAl aLeuLeuProProProProProProProProThrProAlaThrProThrSer 
llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

GAGCTGGCGTTGCTCCCGCCACCGCCGCCGCCGCCGCCGACTCCCGCGACCCCGACGTCC 



101 
301 
121 



SerAl aSerAsnLeuAspLeuGlyGl uGl nArgGl uArgTrpGl uThrPheGl nLysArg 

llllllllllllllllllll MM llllllllTlllMll IIIIIIIIIIMIIII III! 

TCGGCGTCCAACCTGGACCTGGGCGAGCAGCGGGAGCGCTGGGAGACGTTCCAGAAGCGG 361 



122 Gl nLysLeuThrserGl uGl yAl aAl aLysLeuLeuLeuAspThrPheGl uTyrGl nGly 141 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
362 CAGAAGCTTACCTCCGAGGGTGCCGCCAAGCTCCTGCTAGACACCTTTGAATACCAGGGC 42 1 

142 LeuvalLysHisThrGlyGlyCysHisCysGlyAlaValArgPheGluValTrpAlaSer 161 
M MM MM MMMM MM M M MIMMMIM M MMIMMM MMI MM 

422 CTGGTGAAGCACACAGGAGGCTGCCACTGTGGAGCAGTTCGTTTTGAAGTTTGGGCCTCA 481 



AlaAspLeuHisllePheAspcys 169 
llllllllllllllllllllllll 

GCAGACTTGCATATATTTGACTGC 505 



AF514992 
Homo sapiens 
AF514992 
AF5 14992.1 Gl 



nuclear 



1097 
protein 



bp 
p30 



mRNA linear 
mRNA, partial cds. 



PRI 16-SEP-2002 



21717631 
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SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



CDS 



Untitled 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontogli res; Primates; Catarrhini; 
Hominidae; Homo. 

1 (bases 1 to 1097) 

Cronshaw, J . M . , Krutchinsky,A.N. , Zhang, w. , Chait,B.T. and 
Matuni s , M. J . 

Proteomic analysis of the mammalian nuclear pore complex 

3 . Cell Biol. 158 (5), 915-927 (2002) 

12196509 

2 (bases 1 to 1097) 
Cronshaw, J . M . and Matunis , M. J . 
Direct Submission 

Submitted (23-MAY-2002) Biochemistry and Molecular Biology, Johns 
Hopkins Bloomberg School of Public Health, 615 N. Wolfe St., 
Baltimore, MD 21205, USA 

Locati on/Qual i f i ers 

1. .1097 

/orqanism="Homo sapiens" 
/mo T_type="mRNA" 
/db_xref="taxon:9606" 
/clone="IMAGE: 3936128" 
<1. .817 

/note="enriched at the nuclear periphery" 
/codon_start=2 

/product="nuclear protein p30" 
/protei n_i d="AAM76703 . 1" 
/db_xref="Gl: 21717632" 

/translation="RRSRSSAAAKLRGQKRSGASAAPAASAAAALAPSATRTRRSASQ 
AGSKSQAVEKPPSEKPRLRRSSPRAQEEGPGEPPPPELALLPPPPPPPPTPATPTSSA 
SNLDLGEQRERWETFQKRQKLTSEGAAKLLLDTFEYQGLVKHTGGCHCGAVRFEVWAS 
ADLHIFDCNCSICKKKQNRHFIVPASRFKLLKGAEHITTYTFNTHKAQHTFCKRCGVQ 
SFYTPRSNPGGFGIAPHCLDEGTVRSMVTEEFNGSDWEKAMKEHKTIKNMSKE" 



ORIGIN 



Alignment Scores: 
Pred. No. : 
Score: 

Percent similarity: 
Best Local similarity: 
Query Match: 
DB: 



2.72e-142 


Length: 


1097 


168.00 


Matches: 


168 


100.00% 


Conservative: 


0 


100.00% 


Mismatches: 


0 


96.00% 


Indels: 


0 


8 


Gaps: 


0 



US-10-530-886-2 (1-175) x AF514992 (1-1097) 



QY 
Db 



2 ArgArgSerArgSerSerAlaAlaAlaLysLeuArgGlyGl nLysArgSerGlyAlaSer 21 

lllllTlll llflllllMIIIIIMIIIIIIMIIII IIMMIIIlllllll MINI 

2 CGGCGATCGAGGAGCTCTGCGGCCGCCAAGCTGCGCGGGCAGAAGCGGTCCGGGGCCTCC 61 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



22 
62 
42 



Al aAl aProAl aAl aSerAl aAl aAl aAl aLeuAl aProSerAl aThrArgThrArgArg 
MINIM! IIIIIIIIMIIIIIIIIIIIIIIIIIII IIMMIIIlllllll Mllll 

GCGGCCCCCGCGGCCTCCGCGGCCGCTGCCTTGGCACCCAGCGCCACCCGCACACGGCGC 



41 
121 



SerAl aSerGl nAl aGlySerLysSerGl nAl aValGl uLysProProSerGl uLysPro 61 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
122 TCCGCTAGCCAGGCCGGGAGCAAGAGCCAGGCGGTGGAGAAGCCGCCGTCGGAGAAGCCG 181 

62 ArgLeuArgArgSerSerProArgAl aGl nGl uGl uGl yProGlyGl uProProProPro 81 

iMIIMlilMllllllllllllllllllllllI III MMMMMIIMM Mllll 

182 CGGCTGAGGCGCTCGTCGCCGCGGGCCCAGGAGGAGGGCCCGGGGGAGCCGCCGCCGCCT 241 

82 Gl uLeuAl aLeuLeuProProProProProProProProThrProAl aThrProThrSer 101 

Page 8 



Db 


242 


Qy 


102 


Db 


302 


Qy 


122 


Db 


362 


Qy 


142 


Db 


422 


Qy 


162 


Db 


482 



Untitled 

I I I I III I I I I II I I I I I I I I I llllll III 111 I I I I I I I I I I I I I I I I I I I I I I I I I I 
GAGCTGGCGTTGCTCCCGCCACCGCCGCCGCCGCCGCCGACTCCCGCGACCCCGACGTCC 301 

SerAl aSerAsnLeuAspLeuGl yGl uGl nArgGl uArgTrpGl uThrPheGl nLysArg 121 
I I I ; I I 1 I I I I III M I M I M MM M III III I M Ml I I I M I I I I I I I I I I I I I M 

TCGGCGTCCAACCTGGACCTGGGCGAGCAGCGGGAGCGCTGGGAGACGTTCCAGAAGCGG 361 

Gl nLysLeuThrSerGl uGl yAl aAl aLysLeuLeuLeuAspThrPheGl uTyrGl nGl y 141 
I I I I II I I I I I I I I I I I I I I I I III I II I II II I I I I I I I I I I I I I I I I I I I II I I I I I I 

CAGAAGCTTACCTCCGAGGGTGCCGCCAAGCTCCTGCTAGACACCTTTGAATACCAGGGC 42 1 

LeuVal LysHi sTh rGl yGl yCysHi sCysGl yAl aval Arg PheGl uVal TrpAl ase r 161 
I I I I I II I I I I II I I I I I I I I I I II I Mill I I I I I I I I I III I I I I I I I I I I I M I I I I 
CTGGTGAAGCACACAGGAGGCTGCCACTGTGGAGCAGTTCGTTTTGAAGTTTGGGCCTCA 481 

AlaAspLeuHisllePheAspCys 169 
MM III II Mill I III llllll 
GCAGACTTGCATATATTTGACTGC 505 
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Untitled 
/ersion 5 

Copyright (c) 1993 - 2006 Compugen Ltd. 



Gencore version 5.1.6 Q^O^i^ *fc> 7 



OM protein - protein search, using sw model 

Run on: January 12, 2006, 12:44:40 ; Search time 116.986 Seconds 

(without alignments) 
1073.498 Million cell updates/sec 

Title: US-10-530-886-4 
Perfect score: 178 

Sequence: 1 MRRSRSSAAAKLRGQKRSGA FEVWASADLHIFDCKYRNYI 178 

scoring table: OLIGO 

Gapop 60.0 , Gapext 60.0 

Searched: 2166443 seqs, 705528306 residues 

Word size : 0 

Total number of hits satisfying chosen parameters: 2166443 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Listing first 75 summaries 

Database : UniProt_05.80:* 

1: uniprot^sprot :* 
2 : uni prot_trembl : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. score Match Length DB ID Description 



1 


172 


96. 


6 


275 


2 


Q7Z7K6_HUMAN 


Q7z7k6 


homo sapien 


2 


152 


85. 


4 


271 


2 


Q8NFH6_HUMAN 


Q8nfh6 


homo sapien 


3 


43 


24. 


2 


252 


2 


Q5SX21_MOUSE 


Q5sx21 


mus musculu 


4 


43 


24. 


2 


252 


2 


Q9CXS4_MOUSE 


Q9cxs4 


mus musculu 


5 


17 


9. 


6 


569 


2 


Q4SLH6_TETNG 


Q4slh6 


tetraodon n 


6 


13 


7. 


3 


141 


2 


Q6N8J6_RHOPA 


Q6n8j6 


rhodopseudo 


7 


12 


6. 


7 


268 


1 


LBDl3_ARATH 


Q9at61 


arabidopsis 


8 


12 


6. 


7 


365 


2 


Q4KLX2_XENLA 


Q4klx2 


xenopus lae 


9 


12 


6. 


7 


700 


2 


Q8S976_ORYSA 


Q8s976 


oryza sativ 


10 


12 


6. 


7 


700 


2 


Q653H7_ORYSA 


Q653h7 


oryza sativ 


11 


12 


6. 


7 


2025 


2 


Q99PP2_MOUSE 


Q99pp2 


mus musculu 


12 


11 


6. 


2 


136 


2 


Q9CV3LMOUSE 


Q9cvjl 


mus musculu 


13 


11 


6. 


2 


301 


2 


Q6L4Dl_ORYSA 


Q614dl 


oryza sativ 


14 


11 


6. 


2 


425 


2 


Q8ND23_HUMAN 


Q8nd23 


homo sapien 


15 


11 


6. 


2 


588 


2 


Q5XHY1_RAT 


Q5xhyl 


rattus norv 


16 


11 


6. 


2 


613 


2 


Q6ZLDl_ORYSA 


Q6zldl 


oryza sativ 


17 


10 


5. 


6 


65 


2 


Q8QTCl_WSSV 


Q8qtcl 


white spot 


18 


10 


5. 


6 


65 


2 


Q8VAQ0 WSSV 


Q8vaq0 


white spot 


19 


10 


5. 


6 


74 


2 


Q59EF8_HUMAN 


Q59ef8 


homo sapien 


20 


10 


5. 


6 


130 


2 


Q4J1V7_AZ0VI 


Q4 j lv7 


azotobacter 


21 


10 


5. 


6 


134 


2 


Q5DGR5_SCHJA 


Q5dgr5 


schistosoma 


22 


10 


5. 


6 


137 


2 


Q69U86_ORYSA 


Q69u86 


oryza sativ 


23 


10 


5. 


6 


137 


2 


Q4JMUL.9BACT 


Q4jmul 


uncultured 


24 


10 


5. 


6 


137 


2 


Q7NFCl_GLOVI 


Q7nfcl 


gloeobacter 


25 


10 


5. 


6 


138 


2 


Q7QE5LANOGA 


Q7qe51 


anopheles g 


26 


10 


5. 


6 


151 


1 


YWIS_WHEAT 


P14723 


triticum ae 


27 


10 


5. 


6 


158 


2 


Q9DVW0_9BACU 


Q9dvw0 


plutella xy 


28 


10 


5. 


6 


185 


1 


NO20_SOYBN 


P08960 


glycine max 


29 


10 


5. 


6 


221 


2 


049946_SOLTU 
Page 1 


049946 


solanum tub 



Untitled 



3 A 
3U 


1U 


c 
3 


C 

. 0 






AQAKIi 3 ADVCA 


i^of n 1 z 


OryZa Sal 1 V 


3 1 


1 a 
1U 


c 

b 


. b 


3 ^ A 
Z30 


L 


n7v\/C7 r\D\/c a 
Q/ XVb /_OKY5A 


A7vi/C 7 


oryza. saiiv 


3 3 

51 


i a 
IU 


b 


. b 


3 3. A 
Z30 


~\ 
L 


Q54Z / b_HPV4/ 


Q04Z / b 


numdn pap 1 1 


3 3 

55 


i A 
XU 


c 
3 


. 0 


LoL 


X 


Afuni ui IMAM 


DRftft/lA 
r 300H-U 


nomo 5api en 


3/1 
34 


1 a 
XU 


c 
3 


. 0 


3 

ZOO 


9 


aC3maA yckitd 
v^3Z|VlA*f AtN 1 K 


A C9maA 
Li3ZfTla*t 


vononi ic *^ v*r\ 

xenopus iro 


33 


i a 
1U 


r 
b 


. b 


37*3 
Ll 5 




3 i£ D Y 0 .JJ K Y S A 


A £ -7ri\/G 
^3ZDyO 


OryZa 5al.IV 


3 C 
3D 


1 A 

IU 


b 


. b 


3 70 

z / y 


L 


QbCoQ0_yADcN 


A C r- ft n A 


si mi an aaen 


3 7 
3/ 


1 A 

IU 


b 


. b 


3 C3 


Z 


aOddi iA Mrti ice 
QodRU0_M0U5E 


aQU n iA 

i^oD ruu 


rnus muscu i u 


"3 Q 
3D 


1 A 
IU 


b 


. b 


TOO 

£03 




QbtrAZ^OKYbA 


^oepaz 


oryza sdii v 


3y 


i a 


r 
3 


. 0 


3QA 

zy4 




r\Q.W\AA ui imam 
^orMX^^f^nUlviAN 


Aftnl Ail 


norno 5ap i en 


/I A 

4U 


1U 


r 

b 


. b 


3 AA 
30U 


z 


Q4T HHb_TcTNG 


AVI4.UU C. 

144 tnno 


tetraoaon n 


41 


IU 


b 


. b 


3 A/1 
304 


X 


Vt4_HPV4/ 


D3 3/131 

PZZ4ZX 


numan pctpi i 


A 3 
4Z 


1 A 
1U 


r 

b 


. b 


3 1 Q 
3I0 


-) 
L 


Qb 3 JGZ_0KY5A 


AC 3 -» #-i3 

QbJjgz 


oryza sativ 


A 3 

45 


i a 
ID 


b 


. b 


3 31 
33l 


L 


QbP0GZ_BKARE 


Qbpogz 


brachydani o 


44 


1 A 
IU 


b 


. b 


ore 

3bb 


L 


r\ O 1 TnO f\ n \/C" A 

QoLXRo_ORY5A 


Qo 1 1 To 


oryza sativ 


43 


1 A 
IU 


b 


. b 


5/5 


X 


CD 1 Tl UI IMAM 

r bLXX HUMAN 


f~\Q\*ti 1 r-i 3 

LioWUpZ 


nomo sapi en 


a a 
45 


10 


r 

b 


. b 


3 AC 

iyb 


I 


nCk IT r~ C \_l\iKAAK\ 

QbNTFb_HUMAN 


Qbntrb 


homo sapi en 


A 7 
4/ 


1 A 
10 


b 


. b 


3 QQ 

3yo 


X 


5rb_M0U5E 


r\Ci-l Uw3 

Qyi nxz 


mus muscu i u 


4o 


1 A 
10 


p 
b 


. b 


3 QQ 

3yo 


z 


aCDCD/1 u I imam 

QbDtD4_HUMAN 


QbDeD4 


nomo sapi en 


A Ck 

4y 


1 A 
10 


b 


. b 


A 1 3 
41Z 


z 


QbCArb_ORYbA 


QbCarb 


oryza sativ 


C A 

50 


1 A 
10 


b 


. b 


>1 7 1 

4^1 


X 


ACRO_HUIv|AN 


Dl A33 3 

PXO3Z 5 


homo sapi en 


3l 


1 A 
10 


b 


. b 


A3 1 
4^1 


L 


n^Tri/") LillMAM 

Q0XCKZ_HU|V1AN 


aCi r~ lx 3 

v^O 1 CKZ 


nomo sapi en 


C 3 

3Z 


"1 A 
10 


b 


. b 


/I 3 7 
4Z / 


L 


QbETLZ__0RY5A 


Qbe u 1 z 


oryza sanv 


C "3 
33 


"1 A 
10 


b 


. b 


A C 7 
4b / 


~y 
L 


aC7|/I/^ DAT 

QbZKK4_RAT 


QbZKK4 


rai-LUS norv 


34 


1 A 
10 


b 


. b 


4bl 


~) 
L 


Qb B J ZX_BRARE 


QbDj Z X 


Dracnyaam o 


c c 

33 


1 A 
10 


b 


. b 


4bo 


-5 
L 


QbPy4X_BRARE 


Qbpy4X 


Dracnyaam o 


bb 


1 A 
10 


b 


. b 


A73 
4/Z 


L 


/"\ C I I A A 3 VCMI A 

QbU4A3 — XENLA 


libU4a3 


xenopus lae 


r 7 

3/ 


1 A 
10 


b 


. b 


4/b 


L 


aHC rT7 An ATU 

Qy SCT /_ARATH 


Qysc t/ 


araoi aopsi s 


bo 


1 A 
10 


b 


. b 


4/b 


Z 


QbZKAb_CnXCK 


A r "7l/aR 

v43ZKaO 


ga i i us ga i i 


by 


1 A 
10 


r 

b 


. b 


4/0 


Z 


Q b XT L b _M AGG R 


am j_t c 

Qbx l 1 b 


magnapo r tne 


bU 


1 A 
10 


b 


. b 


4©y 


Z 


aC^yt7 uiimam 


Aft £ vf 7 


nomo sapi en 


bl 


1 A 
10 


b 


. b 


4yb 


Z 


ACrCrA vadi t 
v40LoGU_YARLX 


A^r-ftn A 


yarrowi a i l 


£3 

bZ 


1 A 
10 


b 


. b 


AQ7 

4y / 


Z 


143 H-LWZ_UXLUX 


a^z11\a/3 

t^3 *T 1 WZ 


□i c uyos Lc 1 1 


a 3 
03 


1 A 
10 


b 


. b 


CIO 

biy 


Z 


r\ A O C C C /~\ D \/C A 

04obb b„0RY5A 


A/lftAC C 

U4obb b 


oryza sativ 


C A 

o4 


1 A 
10 


p 
b 


. b 


C A C 

b4b 


z 


QbY5Xo — URY5A 


Qbysxo 


oryza sanv 


ob 


1 A 
10 


b 


. b 


bob 


z 


aCdimC DUX/DA 

QbRJ Nb_PHYPA 


Q6r] n6 


pnyscomi tre 


00 


1 A 
10 


b 


. b 


c 03 

JVC. 


z 


rtCniM7 DUVDA 

QbRjN/_PHYPA 


Q6r3 n7 


pnyscomi ure 


£7 
0/ 


1 A 
10 


b 


. b 


blo 


Z 


aQc^i/Q advca 
QobbKy_URYbA 




oryza sauiv 


05 


1 A 
10 


b 


. b 


blo 


Z 


AQQl/\/C MAI ICC 

Qyyiwo_Muubb 


AQQ Lrv/ft 

uiyy Kvo 


miip mi ic n il n 

mus muscu i u 


C A 

by 


1 A 
10 


b 


. b 


bbb 


Z 


Q/TVGZ_MYCdO 


LJ/ Lvy Z 


mycoDac te n 


/u 


1 A 
10 


b 


. b 


bbb 


z 


nC Q 7 A A MvrTi 1 
Uby / 4U_MYCTU 


AA07AA 
U0y / 4U 


mycoDacte n 


71 


in 




■ u 




3 
c 


v^O r\ V C «y Vy l\ I JrA 




opvza sativ 

ui y i—\x jul i v 


72 


10 


5 


.6 


670 


2 


Q593S5_CYPCA 


Q593s5 


cyprinus ca 


73 


10 


5 


.6 


672 


2 


Q8LMH2_ORYSA 


Q81mh2 


oryza sativ 


74 


10 


5 


.6 


687 


2 


Q8BKD3_MOUSE 


Q8bkd3 


mus musculu 


75 


10 


5 


.6 


698 


2 


Q7XF73_ORYSA 


Q7xf73 


oryza sativ 



ALIGNMENTS 



RESULT 1 
Q7Z7K6_HUMAN 

ID Q7Z7K6_HUMAN preliminary; prt; 275 AA. 

AC Q7Z7K6; 

DT Ol-OCT-2003 (TrEMBLrel . 25, Created) 

DT Ol-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-MAR-2004 (TrEMBLrel. 26, Last annotation update) 

DE Hypothetical protein. 

GN Name=PRR6; 

OS Homo sapiens (Human). 

oc Eukaryota; Metazoa; chordata; craniata; Vertebrata; Euteleostomi ; 

oc Mammalia; Eutheria; Euarchontogli res; Primates; catarrhini; Hominidae; 

oc Homo. 

OX NCBl_TaxlD=9606; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Blood; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10. 1073/pnas .242603899; 

RA strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., shenmen CM., Schuler G.D., 
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Untitled 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. ( Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K. , Farmer A. A. , Rubin G.M., Hong L., 

RA stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C. , 

RA Raha s.s., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S. , worley K.C., Hale S. f Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M. , Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA whiting M. , Madan A., Young A.C., shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E. D. v Dickson M.c, 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., skalska U. f Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

RN [2] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Blood; 

RA Director MGC Project; 

RL Submitted (MAY-2003) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; BC052604; AAH52604.1; -; mRNA. 

DR Ensembl ; ENSG00000166582 ; Homo sapiens. 

DR HGNC; HGNC: 29920; PRR6. 

DR GO; GO:0016846; F: carbon-sulfur lyase activity; IEA. 

DR GO; GO:0008152; P:metabolism; IEA. 

DR interPro; IPR006913; GFA. 

DR Pfam; PF04828; DUF636; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 275 AA; 29946 MW; E9A69F1F9BA746D8 CRC64; 

Query Match 96.6%; Score 172; DB 2; Length 275; 

Best Local Similarity 100.0%; Pred. No. 3.6e-141; 

Matches 172; Conservative 0; Mismatches 0; Indels 0; Gaps 0 
Qy 1 MRRSRSSAAAKLRGQKRSGASGASAAPAASAAAALAPSATRTRRSASQAGSKSQAVEKPP 60 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 

Db 1 MRRSRSSAAAKLRGQKRSGASGASAAPAASAAAALAPSATRTRRSASQAGSKSQAVEKPP 60 

Qy 61 SEKPRLRRSSPRAQEEGPGEPPPPELALLPPPPPPPPTPATPTSSASNLDLGEQRERWET 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 61 SEKPRLRRSSPRAQEEGPGEPPPPELALLPPPPPPPPTPATPTSSASNLDLGEQRERWET 120 

Qy 121 FQKRQKLTSEGAAKLLLDTFEYQGLVKHTGGCHCGAVRFEVWASADLHIFDC 172 

I I I I I I I I I I I II I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 FQKRQKLTSEGAAKLLLDTFEYQGLVKHTGGCHCGAVRFEVWASADLHIFDC 172 
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